When Will It Be Done? Machine Learner Answers to the 300-Billion-Dollar Question

نویسنده

  • Gary D. Boetticher
چکیده

W hen will it be done? " Senior managers will ask their software project managers this question more than 250,000 times this year. Corporations, which collectively commit over US$300 billion annually toward new software project initiatives, 1 will want to know the answer. However, when you consider Barry Boehm's claim that early software life-cycle estimates vary by a factor of four (25 to 400 percent), 2 providing an accurate, reliable project estimate presents a challenge indeed. To answer the question, a project manager might resort to one of three estimation approaches: human-based, algorithmic, or machine learner-based. Managers will use a human-based approach—which includes expert judgment, analogy, and rule of thumb—87 percent of the time, algorithmic or machine learners only 13 percent of the time. 3 Considering the popularity of algorithmic models (such as Function Point Analysis) for estimating effort, machine learner use is probably much less than half of 13 percent—perhaps three or four percent. This is surprising considering recent successes in machine learner estimation. In three instances, machine learners produced project effort estimates within 25 percent accuracy at least 75 percent of the time. The high-water mark of these efforts produces estimates with 25 percent accuracy 83 percent of the time. 4 All three cases greatly exceed Boehm's factor-of-four expectation. I contend that a greater application of AI, through machine learners , in software estimation would greatly improve accuracy, reliability, and repeatability in software development. To standardize results, I define two benchmarks for the case studies: • Prediction accuracy in project estimation refers to how often a machine learner can achieve a result within a specified range. The informal standard in effort estimation is 25 percent accuracy, denoted as pred(0.25). So, a result of pred(0.25) = 75 percent means the machine learner model produces results with 25 percent accuracy 75 percent of the time. • Mean magnitude of relative error is the average difference between actual and calculated results in absolute terms. Many consider an MMRE below 25 percent a good model and below 50 percent a usable model. All three cases assume an accuracy rate of pred(0.25) and build their machine learner-based models using data extracted from industrial settings , as opposed to deriving models from simulations. This reinforces these results' real-world applicability. development effort by applying a back-propagation neural network model to the Australian Software Metrics Association data set. 5 You can obtain the ASMA data set …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Worldwide Development of Wind Energy and CO2 Emission Reduction

One of the most considerable issues in the global society is reducing CO2 emissions of industrial activities and its consequences. Therefore, exploitation of clean energy sources is one of the requirements of the international community. In this regard, the researchers investigated the impact of renewable energy utilization to achieve the low carbon economy and concluded that it can significant...

متن کامل

Ethical questions have no moral answers Evidence that "ethic" based on "metaphysical foundations"

"Is ethic based on metaphysical principles?" Hillary Putnam and Kai Nielsen answer to this question is negative; On the contrary, from the point of view of this article, the answer to this question is positive. Many scholars of Islamic ethic have also given an implicit positive answer to this important question. One of the important evidences that shows that ethic is based on metaphysical princ...

متن کامل

ارائه روشی پویا جهت پاسخ به پرس‌وجوهای پیوسته تجمّعی اقتضایی

Data Streams are infinite, fast, time-stamp data elements which are received explosively. Generally, these elements need to be processed in an online, real-time way. So, algorithms to process data streams and answer queries on these streams are mostly one-pass. The execution of such algorithms has some challenges such as memory limitation, scheduling, and accuracy of answers. They will be more ...

متن کامل

A Computational Method of Complexity of Questions on Contents of English Sentences and its Evaluation

This paper describes a computational method of complexity of a question for an adaptive question and answer function in an intelligent support system for English learning, and its evaluation. To realize adaptive question and answer, systems should generate questions depending on both educational intentions and learner's understanding state. For generating suitable questions for a learner automa...

متن کامل

Teacher Wait-Time and Learner Initiation: A Single Case Analysis

The prevailing pattern of classroom interaction is a tripartite exchange structure known as IRF (teacher initiation, student response, teacher follow-up/feedback; Sinclair & Coulthard, 1975). Although it has its own contributions to classroom discourse, it has been criticized on several grounds, particularly for affording minimum learner participation opportunities (Kasper, 2001). An alternativ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Intelligent Systems

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2003